Intelligently Integrating Information from Speech and Vision Processing to Perform Light-weight Meeting Understanding
نویسندگان
چکیده
Important information is often generated at meetings but identifying, and retrieving that information after the meeting is not always simple. Automatically capturing such information and making it available for later retrieval has therefore become a topic of some interest. Most approaches to this problem have involved constructing specialized instrumented meeting rooms that allow a meeting to be captured in great detail. We propose an alternate approach that focuses on people’s information retrieval needs and makes use of a light-weight data collection system that allows data acquisition on portable equipment, such as personal laptops. Issues that arise include the integration of information from different audio and video streams and optimum use of sparse computing resources. This paper describes our current development of a light-weight portable meeting recording infrastructure, as well as the use of streams of visual and audio information to derive structure from meetings. The goal is to make meeting contents easily accessible to people.
منابع مشابه
Auditory processing skills in brainstem level of autistic children: A Review Study
Aims: Autism is a pervasive developmental disorder. Deficit in sensory functions is one of the characteristics of people with autism, and usually these people show abnormality in processing and correct interpretation of auditory information. Also people with Autism show problems in communicating with others. This review article deals with the accurate understanding of Auditory processing skills...
متن کاملMabel: Extending Human Interaction and Robot Rescue Designs
Mabel (the Mobile Table) is a robotic system that can perform waypoint navigation, speech generation, speech recognition, natural language understanding, face finding, face following, nametag reading, and localization. Mabel can interact intelligently to give information about the conference to patrons. Major additions to this year’s design are Monte Carlo Localization, Filter-Cascade technique...
متن کاملMabel: Extending Human Interaction and Robot Rescue Design
Mabel (the Mobile Table) is a robotic system that can perform waypoint navigation, speech generation, speech recognition, natural language understanding, face finding, face following, nametag reading, and localization. Mabel can interact intelligently to give information about the conference to patrons. Major additions to this year’s design are Monte Carlo Localization, Filter-Cascade technique...
متن کاملMabel: Building a Robot Designed for Human Interaction
Mabel (the Mobile Table) is a robotic system that can perform waypoint and vision guided navigation, speech generation, speech recognition, person finding, face finding, and face following. Mabel can interact intelligently with humans in two different settings: food and information serving. The robot’s architecture is flexible and easily adaptable to other tasks such as search and rescue.
متن کاملEnriching machine-mediated speech-to-speech translation using contextual information
Conventional approaches to speech-to-speech (S2S) translation typically ignore key contextual information such as prosody, emphasis, discourse state in the translation process. Capturing and exploiting such contextual information is especially important in machine-mediated S2S translation as it can serve as a complementary knowledge source that can potentially aid the end users in improved unde...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005